Indexing pharmacogenetic knowledge on the World Wide Web.
نویسندگان
چکیده
A key challenge for pharmacogenetics is the creation of databases to store, analyse and disseminate important datasets in order to catalyse research and training. Most successful databases have a limited scope: Genbank contains DNA sequences [1]; the Protein Data Bank contains the three-dimensional coordinates of macromolecules [2]; the Online Mendelian Inheritance in Man contains a record of human genetic disease [3]; and PubMED contains the biomedical literature [4]. This limited scope is a great strength, because the information can be stored, searched and analysed using a few powerful tools, and the users of these databases know exactly what to expect. Databases for pharmacogenetics and pharmacogenomics will have much more diversity. Pharmacogenetic data involve phenotypes that are as diverse as the assays we invent to measure them. Thus, it is unclear what a user should expect from a pharmacogenetics database, and yet a public repository of pharmacogenetic data is critical to establish a core dataset for the field upon which we can build new analyses and new hypotheses [1]. Clearly, successful databases for pharmacogenetics must employ some sort of classification of phenotypes that is general purpose, yet extensible to include undefined characterizations of phenotype.
منابع مشابه
WebWatcher: Knowledge Navigation in the World Wide Web
Many have noted the need for software to assist people in locating information on the World Wide Web. Although effective tools exist, they typically rely on brute-force scanning and indexing of Web pages for later keyword-based retrieval. Such tools ignore at least two sources of knowledge which might prove useful in navigation and retrieval: (1) the structure of the Web as a graph, and (2) the...
متن کاملIndexing The World Wide Web: The Journey So Far
In this chapter, we describe the key indexing components of today’s web search engines. As the World Wide Web has grown, the systems and methods for indexing have changed significantly. We present the data structures used, the features extracted, the infrastructure needed, and the options available for designing a brand new search engine. We highlight techniques that improve relevance of result...
متن کاملSuitability of Signature Indexing Over the World Wide Web
Signature indexing has been studied extensively in text database or other databases for many years. The main advantages of a signature le as an access index are its small size, distributability, the ability to index information of a wide variety of types, ease of maintenance, and the ability to provide fuzzy indexing. These features are precisely what are needed for a good access index for inde...
متن کاملIntegrating Background Knowledge into Nearest-Neighbor Text Classification
This paper describes two different approaches for incorporating background knowledgeinto nearest-neighbor text classification.Our first approachuses backgroundtext to assessthe similarity betweentraining and test documentsrather than assessing their similarity directly. The second method redescribes examples using Latent Semantic Indexing on the background knowledge, assessing document similari...
متن کاملRanking Web Pages Using Collective Knowledge
Indexing is a crucial technique for dealing with the massive amount of data present on the web. Indexing can be performed based on words or on phrases. Our approach aims to efficiently index web documents by employing a hybrid technique in which web documents are indexed in such a way that knowledge available in the Wikipedia and in meta-content is efficiently used. Our preliminary experiments ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pharmacogenetics
دوره 13 1 شماره
صفحات -
تاریخ انتشار 2003